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GENERALIZING SIMES' TEST AND HOCHBERG'S 
STEPUP PROCEDURE 1 

By Sanat K. Sarkar 

Temple University 

In a multiple testing problem where one is willing to tolerate a 
few false rejections, procedure controlling the familywise error rate 
(FWER) can potentially be improved in terms of its ability to detect 
false null hypotheses by generalizing it to control the fc-FWER, the 
probability of falsely rejecting at least fc null hypotheses, for some 
fixed fc > 1. Simes' test for testing the intersection null hypothesis 
is generalized to control the fc-FWER weakly, that is, under the in- 
tersection null hypothesis, and Hochberg's stepup procedure for si- 
multaneous testing of the individual null hypotheses is generalized 
to control the fc-FWER strongly, that is, under any configuration of 
the true and false null hypotheses. The proposed generalizations are 
developed utilizing joint null distributions of the fc-dimensional sub- 
sets of the p-values, assumed to be identical. The generalized Simes' 
test is proved to control the fc-FWER weakly under the multivariate 
totally positive of order two (MTP2) condition [J. Multivariate Anal- 
ysts 10 (1980) 467-498] of the joint null distribution of the p- values 
by generalizing the original Simes' inequality. It is more powerful to 
detect k or more false null hypotheses than the original Simes' test 
when the p- values are independent. A stepdown procedure strongly 
controlling the fc-FWER, a version of generalized Holm's procedure 
that is different from and more powerful than [Ann. Statist. 33 (2005) 
1138-1154] with independent p- values, is derived before proposing the 
generalized Hochberg's procedure. The strong control of the fc-FWER 
for the generalized Hochberg's procedure is established in situations 
where the generalized Simes' test is known to control its fc-FWER 
weakly. 
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1. Introduction. Given a collection of null hypotheses and the corre- 
sponding p- values in multiple testing, one encounters two types of problem: 
(i) global testing of the intersection null hypothesis and (ii) simultaneous 
testing of the individual null hypotheses. Traditionally, one seeks to control 
the probability of falsely rejecting the intersection null hypothesis, the global 
Type I error rate, in global testing and the probability of rejecting at least 
one true null hypothesis, the familywise (Type I) error rate (FWER), in 
simultaneous testing. As a global test, Simes' [19] test has received consid- 
erable attention. With the p-values marginally having uniform distributions 
on (0, 1) under the null hypotheses, it controls the global Type I error rate at 
the desired level exactly under independence [19] and conservatively under 
the multivariate totally positive of order two (MTP2) condition of Karlin 
and Rinott [10] shared by commonly encountered multivariate distributions 
in multiple testing [15, 18]. A careful study of the proof in [15], of course, 
reveals that the conservativeness of Simes' test actually holds for a larger 
class of distributions satisfying the positive regression dependence on sub- 
set (PRDS) condition; see [1, 16] from which this conservativeness under 
the PRDS condition also follows. For simultaneous testing, Hochberg's [5] 
procedure has also received considerable attention. It is a stepup procedure 
based on the same critical values as those of the stepdown procedure of 
Holm [7] and controls the FWER in the same situations where Simes' test 
controls its global Type I error rate. 

Recently, the notion of FWER has been generalized to that of the fc-FWER, 
the probability of rejecting at least k true null hypotheses. It is argued that 
in many situations, one is willing to tolerate a few false rejections but wants 
to control too many of them, say k or more. In such a case, a procedure 
controlling the fc-FWER, for some fixed k > 1, will have a better ability 
to detect k or more false null hypotheses than the corresponding FWER 
{k = 1) procedure. A number of such procedures have been proposed in the 
literature [3, 9, 11, 12, 13, 21]. Motivated by the increasing importance of 
the concept of fe-FWER and the scope of further strengthening some of the 
related procedures proposed in the literature under certain situations, we 
consider developing newer &-FWER procedures in this article. More specif- 
ically, we generalize both Simes' global and Hochberg's simultaneous tests. 

While Simes' test is basically a global test, it corresponds to a simulta- 
neous test with a control of the FWER in a weak sense, that is, under the 
intersection null hypothesis. Therefore, the notion of A;-FWER, which has 
been discussed so far primarily in the context of simultaneous testing, can 
be applied to Simes' global test as well. In other words, one could poten- 
tially improve the power of Simes' test to detect k or more false component 
null hypotheses by generalizing it to a global test that corresponds to a 
simultaneous test with a weak control of the fc-FWER. This would be par- 
ticularly useful in a situation where one is basically interested in testing the 
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intersection of a family of null hypotheses but likes to detect more false null 
hypotheses than just one in the family before declaring the whole family 
significant. 

For the original or generalized Simes' test, by simply saying that it controls 
the FWER or /c-FWER, we will mean that it does so weakly, that is, when 
the intersection null hypothesis is true. For a simultaneous test, of course, 
the control of the /c-FWER will be meant to be in the strong sense, that is, 
under any configuration of true and false null hypotheses, unless we state 
otherwise. 

Lehmann and Romano [12] gave two simultaneous tests that control the 
/c-FWER, a Bonferroni type single-step test and its Holm type stepdown 
improvement, each based on the marginal p- values. As noted in the present 
article, the stepup analog of this stepdown procedure can also control the 
/c-FWER in the same situations where the original Simes' test controls its 
FWER and thus provides a generalized version of Hochberg's stepup proce- 
dure. Romano and Shaikh [13] developed a general /c-FWER stepup proce- 
dure based on the marginal p-values. Our version of generalized Hochberg's 
procedure controlling the /c-FWER is, however, different. 

Based on some new results on probability distributions of ordered random 
variables established in this article, we notice that the notion of /c-FWER 
allows one to use the /c-dimensional joint null distributions of the p-values. 
Since p-values are often generated in multiple testing from test statistics 
that, under the null hypotheses, have a known distributional form, construct- 
ing /c-FWER procedures utilizing these /cth-order joint null distributions in 
an attempt to improve those relying on the marginal p-values appears to be 
a worthwhile objective. 

Let P{ be the p- value corresponding to the null hypothesis Hi, i = 1, . . . , n, 
and P\- n < • ■ • < P n - n be the ordered p-values with the corresponding null 
hypotheses -ff(i), • • • , H^ n y Assume that the p- values have identical /cth-order 
joint null distributions. Then, using Gk, the common c.d.f. of the maximum 
of any k of the n Pj's under the null hypotheses, we propose the following 
tests: 

The generalized Simes' test. Reject H = fir=i Hi if and only if 
(1.1) P i:n < a max ( i)fc ) for at least one i = 1, . . . , n, 

where a,i is given by 

, \ „ , , i(i-l)---(i-k+l) 

d' 2 ) g >W = ^n-l)-(,- * + !) "■ ' = * 

The generalized Hochberg's procedure. Reject Hu\ for i < i 
and accept Hu\ for i> i , where 

(1-3) i = max {i:P i:n <a max(ifc) } 

Ki<n 
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with a.i given by 

, . , fc(fc -!)•••! 

(1.4) Gfc(Qi)- (n _. + fc)(n _. + A ._ 1) ... (n _. + 1) "> 

i = fc, . . . ,n. 



With the p-values assumed to be distributed, under the null hypotheses, 
marginally as uniform on (0, 1) and jointly with a distribution that exhibits 
positive dependence in the sense of MTP2, we first generalize the original 
Simes' inequality [15, 18, 19]. Restricting to exchangeable MTP2 distribu- 
tions under the null hypotheses, we then prove that the generalized Simes' 
test controls the fc-FWER at a, exactly under the independence and conser- 
vatively otherwise. Assuming identical A;th-order joint null distributions, but 
no special dependence structure, of the p-values, we modify the Bonferroni 
type single-step procedure in [12] and then develop its Holm type stepdown 
improvement. The generalized Hochberg's procedure is then proposed as the 
stepup version of this generalized Holm's stepdown procedure. It is shown to 
control the fc-FWER in situations where the generalized Simes' test controls 
its /c-FWER. 

A generalized stepwise procedure for simultaneous testing that we are re- 
ferring to in this article is a regular stepwise procedure with its first k critical 
values being the same, for some fixed 1 < k < n. For instance, a generalized 
stepdown procedure with critical values a& < • • • < a n accepts any Hu\ for 
which i > mini<j< n {j : Pj- n > a max {j,fc}} if the- minimum exists, otherwise 
rejects all Huy Similarly, a generalized stepup procedure with these critical 
values rejects any for which i < maxi<j< n {j : Pj :n < oc max {j 

,k}} if the 

maximum exists, otherwise accepts all Huy When all these critical values 
are the same, a generalized stepwise procedure is simply a single-step pro- 
cedure. In the single-step procedure of Lehmann and Romano [12] with a 
control of the fc-FWER at a, 014 = ka/n; whereas, in their generalized Holm 
procedure, a.{ = ka/(n + k — i). 

In the next section, we provide the necessary background for understand- 
ing some important technical aspects of this paper. In Section 3, the gener- 
alized Simes' test is developed and the control of its fc-FWER is established 
by generalizing the original Simes' inequality. We consider the probability of 
rejecting k or more false null hypotheses as a measure of power, and compare 
our test with Simes' original test in terms of this. Our test is seen to be theo- 
retically more powerful under the independence, at least when 2 < k < 1/a. 
We conduct a numerical study by simulating normally distributed data to 
examine the extent of power improvement of our test over Simes' under 
the independence and how it changes as the p-values become more depen- 
dent. This study, whose findings are also reported in Section 3, reveals that 
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the generalized Simes' test becomes more powerful with increasing num- 
ber of false null hypotheses, quite significantly under the independence and 
moderately for small correlation. In Section 4, we develop our version of 
generalized Bonferroni single-step procedure and its Holm type stepdown 
improvement before deriving the generalized Hochberg's procedure. Again, 
the generalized Hochberg's procedure is seen to be theoretically a more pow- 
erful fc-FWER procedure than the stepup version of the Lehmann-Romano 
stepdown procedure when 2 < k < 1/a. Another numerical study is con- 
ducted to see how well it actually performs in terms of power compared 
to both the original Hochberg's procedure and the stepup version of the 
Lehmann-Romano procedure under the independence of the p- values, and 
how the power performance changes with increasing dependence among the 
p-values. The concept of average power, the expected proportion of false null 
hypotheses that are correctly rejected [2], is used as a measure of power in 
this numerical study. The generalized Hochberg's procedure is applied to a 
real data set in Section 5. The paper concludes with some additional remarks 
and discussions in Section 6. More specifically, we discuss based on further 
numerical investigations how to modify our procedures when the condition 
of identical fcth-order joint null distributions is not met, and how our pro- 
cedures would perform when the MTP2 condition is violated and when k is 
larger than 1/a. 

2. Preliminaries. First, we present a lemma giving a formula for the 
probability distribution involving the ordered components of any set of ran- 
dom variables, not necessarily p- values. This will play a key role in devel- 
oping the proposed methods in this article by providing explicit formulas 
for the probabilities of k or more rejections of null hypotheses in generalized 
stepup and single-step procedures. Second, we recall from [10], the definition 
of MTP2 distribution and some related results that will be used to establish 
the fc-FWER control of the proposed methods under positive dependence. 
Third, we give an example of distributions satisfying the two main conditions 
assumed in the paper — the MTP2 condition and the condition of identical 
fcth-order joint null distributions. 

2.1. Probability distributions of ordered random variables. Given a set 
of random variables X±, . . . ,X n , we denote the ordered components of the 
set Xj = {Xi, i S J} by X\ : j < ■ ■ ■ < X\j\.j when J C {1, . . . , n}, and by 
Xi. n ,...,X nln when J = {l,...,n}. Let C k = { J : J C {1, . . . , n}, \ J\ = k}, 
the collection of subsets of {1, ... ,71} of size k, and 1(A) be the indicator 
function of a set A. 
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Lemma 2.1. Given any set of constants — oo < < • • • < c n < c n +i = 
oo, for a fixed 1 < k < n, and ai = = k, . . . ,n, 

Prl\J(X r . n < Ci ) 
li=k 

(2.1) = a- 1 MXk-.j < c„} 

JGC fe 
n-1 

^(Xj) = Pr{X i:J c > c fc+i , . . .,X n _ k ,jc > c n |Xj}, 
i = 1, . . . ,n — k. 



When k = 1, Lemma 2.1 reduces to the key result of Sarkar [15] in terms 
of the marginal distributions of the X^s that he used to prove the conser- 
vativeness of the original Simes' test under positive dependence. The idea 
behind that proof is generalized using the /cth-order joint distributions of 
the AYs in the present proof of Lemma 2.1 given in the Appendix. 

Remark 2.1. It is important to note that, when AVs are p- values, the 
left-hand side of (2.1) is the probability of k or more rejections of null hy- 
potheses in a generalized stepup procedure, or, in particular, in a single-step 
procedure, with the Cj's as the critical values, and Lemma 2.1 provides an 
explicit formula for this probability. 

Remark 2.2. Lemma 2.1 can be simplified under certain specific dis- 
tributional structures. For instance, when (Xi, . . . ,X n ) is exchangeable, 



Pr\ [j(X r . n <c, 

Ki=k 

Fk{c n ) + (j^j^2 E i>i-k+i(Xi, . . . ,X fc ) ja" 1 /^ max Xi < c t 



n-1 



where 



F k (x) 



a i+ \/ max . Xi < c m 

\l<i<k 



Pr{ max X; < x 

Ki<k 
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and 



MX 



!)■•■) X n 



M4.n-k > Ck + i, Xttn-k >Cn\X U ..., X k }, 



(-*) 



i = l,...,n-k,withxl n %<---< 
of (Xk+i, ■ ■ ■ ,X n ). 
For i.i.d. X^s, 



(-fc) 

n—k:n—k 



being the ordered components 



Pr\ \j(X i:n <a) 



< i=k 



n-l 



[Fi(c n ,)] fe + ( ? J X] Pr{Xi- k+ l:n-k > <H+1, ■ ■ .,X n -k:n-k > C n } 



i=k 



Without any knowledge about the dependence structure of the X^s, ex- 
cept that the /cth-order joint distributions are all identical, one can obtain 
an upper bound to the probability Pr{lJ" =fc (Xj :n < a)} in terms of F k (x), 
the common c.d.f. of the maximum of any k of the X^s, as follows: 



n 



Pr{ \J(x itn < Ci ) 



<i=k 

= a 



- 1 X Pr{X k:J <c n } 
Jec fe 



n-l 



■ £ ^{a^HXM < a) - a-^I(X k:J < c l+1 )} 

JgCfc i=k 



n-l 



(2.2) 



+ £ ^[{l-^-WXj)} 

JGCfe i=k 



{a-^ x I{X k ,j < ci +1 ) - a^UXkj < a)}} 



n-l 



< Pk(cn) + (?) ^{a- 1 ^^) - ar + \F fc (c l+1 )} 



i=k 



n-l 



i=k 



F k (c k )+ a-^Fki^-F^x)} 

i=k+l 
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This further generalizes Lemma 3.1 in [12] that strengthens a similar in- 
equality in [9]. 

Remark 2.3. Considering a = c k , i = k, . . . , n, in Lemma 2.1, we get the 
following 

Pr{X k ., n < c k } 

= a- 1 Pr{X k:J < c k } 

Jec fc 

n-l 

(2.3) + E E E[Pr{X i:JC > c\Xj}{I(X k:J < c^a" 1 - a^]}] 

J&Ck i=k 

n-l 

< a' 1 E Pr{X k: j < c k } + E Pr{*k:J < c k } E K 7 ' " %+\] 
JGCk J&Ck i=k 

= E Pr{X k ,j < c k }, 
Jec k 

a generalized version of the Bonferroni inequality that, of course, can be 
proved directly using elementary probability theory. 

2.2. MTP2 and related results. Let X = IliLi ^ be a product of totally 
ordered spaces Xi, i = 1, . . . ,n, with the partial ordering defined as follows: 
for any x, y G X, we write x < y if x = [x\ , . . . , x n ) and y = (yi , . . . , y n ) sat- 
isfy Xi < yi in Xi for % = 1, . . . ,n. Let x Vy = (max(xi, yi), max(x n , y n )), 
and x Ay = (min(xi,yi),...,min(2; n ,y n )). 

Definition 2.1. A function [0,oo) is said to be MTP 2 (TP 2 

when n = 2) if for all x, y G X , 

/(xVy)/(xAy)>/(x)/(y). 

An n-dimensional random vector X = (X%, . . . ,X n ) or its distribution is 
called MTP 2 if its density is MTP 2 . 

The following results related to MTP 2 , which can be found in Karlin and 
Rinott [10], will be used in the next section. 

Result 2.1. Let /(x) be MTP 2 in X and g(xi), . . . ,g(x n ) be all increas- 
ing (or decreasing) in Xi, . . . , X n , respectively. Then, f(g(xi), . . . ,g(x n )) is 
MTP 2 in X. 

Result 2.2. If /i(x) and / 2 (x) are both MTP 2 in X, then /i(x)/ 2 (x) 
is MTP 2 in X. 
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Result 2.3. If /(x) is MTP 2 in X, then 




/(x) dxi 



i=m+l 



is MTP 2 in YlZiXi- 

Result 2.4. Let X = (X 1: . . .,X n ) be an MTP 2 random vector. Then, 
for any increasing (or decreasing) function <p on TZ , 1 < k < n, we have that 
E{<p(X)\X k+1 — %k+ii ■ ■ ■ i X n — x Tl \ is increasing (or decreasing) in each of 

Result 2.5. Let X = {X\,. . . ,X n ) be an MTP 2 random vector, and (p 
and tp be both increasing (or decreasing) on lZ n . Then 



Remark 2.4. Result 2.1 says that p-values corresponding to MTP 2 test 
statistics are also MTP 2 as long as they are defined in the same manner, 
each based on either a right-tailed or a left-tailed test, and that Results 
2.2-2.5 can be equivalently stated in terms of p-values. 

2.3. Examples. Karlin and Rinott [10] gave a list of distributions, many 
of which arise in multiple testing, that satisfy the MTP 2 condition under the 
null hypotheses. We will, however, consider a subclass of these distributions 
that satisfy the other condition in this article, namely, the identical feth- 
order joint null distributions. We will describe these distributions in terms 
of test statistics, which are X\, . . . , X n . 

Let Xi, . . . ,X n be continuous random variables that, under the null hy- 
potheses, are i.i.d. conditionally given a random variable Y, with Xi\Y = 
y ~ f(xi,y), i = 1, . . . ,n, where f(xi,y) is TP 2 in (xi,y), and Y ~ g(y). The 
joint density of X±, . . . ,X n under the null hypotheses, which is of the form 



is MTP 2 (follows from Results 2.2 and 2.3). Let F{x) be the common 
marginal (unconditional) c.d.f. of Xi under the null hypotheses. Then, as- 
suming that a right-tailed test based on Xi is used for testing the corre- 
sponding null hypothesis, the joint density of the p- values Pj = 1 — F(Xi), 
i = 1, . . . ,n, can also be expressed in the form (2.4). Distributions like these 
arise often in multiple testing [15, 18]. For instance, the equicorrelated and 
the absolute- valued equicorrelated standard multivariate normals that arise 
in many-to-one comparisons in a balanced one-way layout, and certain types 
of multivariate t, F and gamma distributions. 



£MX)V(X)} >£MX)}£{V(X)}. 



(2.4) 




i=i 
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The distribution function Gk of the maximum of any k of the n p- values 
for the model in (2.4) is given by the following: 



with F(x,y) being the common conditional c.d.f. of Xi given Y = y. In par- 
ticular, for the equicorrelated standard multivariate normal with the corre- 
lation p > 0, it is given by 



where $ and <ft are the c.d.f. and p.d.f. of N(0, 1), respectively. The following 
lemma will be useful in understanding the behavior of the critical values of 
our proposed tests with respect to p for this distribution. 

Lemma 2.2. For the distribution function in (2.5), dGk(u; p)/dp > 0, 
p>0. 

Although a more general result than this lemma is available in [20] , we will 
give a more direct and simple proof of this in the Appendix. It is easy to see 
from this lemma that the ath quantile, say q a {p), that satisfies Gk(q; p) = ot 
is a decreasing function of p > 0. 

3. Generalized Simes' test. Consider testing the intersection null hy- 
pothesis Hq = HiLi H{. The original Simes' test rejects Hq if Pj :n <ia/n for 
at least one i = 1, . . . , n. It corresponds to a stepup simultaneous test with a 
weak control of the FWER that rejects for all i <io and accepts H^ for 
all i > iq, where iq = maxi<j< n {i : Pj :n < ia/n}, if the maximum exists, oth- 
erwise, accepts all Huy In this section, we will first obtain a generalization 
of Simes' test, then present the results of a numerical study investigating its 
power performance. 

3.1. The test. We generalize the above stepup procedure to one that 
controls the fc-FWER weakly. The corresponding global test will be our 
generalized Simes' test. In other words, we consider rejecting Hq if and only 
if Pim < Q max(j,fc) f° r a t least one i = 1, . . . , n, where the constants ctfc < • • • < 
a n are such that the probability of k or more rejections of the component null 
hypotheses under Hq, that is -fV# {lJ" =fc (Pj. n < c^)}, is bounded above by 
a. Toward finding these critical values, we first have the following theorem. 

Theorem 3.1. Let (Pi,...,P n ) have an MTP2 distribution. Then, for 
any fixed < < ■ ■ ■ < a n < 1 and 1 < k < n, we have 




< u < 1 



(2.5) 




(3.1) 
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if a^ 1 Pr(P k: j < OLi) is nondecreasing in i = k, . . . ,n, for all J £ C k . 

Proof. The theorem follows from Lemma 2.1 (in terms of p- values) if 
we can show that 

(3.2) E[ip i _ k+1 (P J ){a^ 1 I(P k ..j < a % ) - a^Ifaj < a i+1 )}} < 0, 

for all J £ C k and i = k, . . . ,n — 1, under the assumed conditions. Clearly, 
this is true under the independence. To prove this under dependence, first 
consider any fixed J £ C k and i in (2.1). Let /(pj) be the density of P j. 
Then, since ol% < cxj+i, we can express the left-hand side of (3.2) as 

Pr{P k:J < a l+ i}£;[^_ fc+1 (Pj)^(Pj)], 

with the expectation taken with respect to the random vector Pj having 
the following density at pj: 

( 3 - 3 9i{PJ) = p ,p t— 

Pr{Pk-.j <a i+ i\ 

and 

fa(Pj) = ar l I(P k ,j < on) - a- + \. 

Since {Pi.jc > a^+j, . . . ,P n -k:j c > ««} is increasing in Pjc, we see from 
Result 2.4 that i/)j_fc + i(Pj) is increasing in Pj. The function ^(Pj) is 
decreasing in P j. As both f(j>j) and I(jpk-.j < «i+i) are MTP2, Result 2.2 
says that the density in (3.3) is also MTP2; that is, Pj is MTP2. Therefore, 
from Result 2.5, we have that the expectation in the left-hand side of (3.2) 
is less than or equal to 

Pr{Pk-.j < a i+ i}E[^_ fc+1 (P J )] J B[0 i (Pj)] 

= E[^ k+l {Pj)][aT l Pr{P k .,j < on} - a-^Pr{P k ,j < a i+1 }], 

which is less than or equal to zero if a^ 1 Pr(P k: j < q) is nondecreasing in i. 
This proves the theorem. □ 



Remark 3.1. Theorem 3.1 is a generalized version of Simes' inequality. 
It reduces to Simes' inequality when k = 1 [15, 18, 19]. It is important to 
note, however, that Simes' inequality actually holds for a slightly wider class 
of positively dependent multivariate distributions. Consider a class of dis- 
tributions of X satisfying the condition: E{tp(X(~ l ^)\Xi = x{\ is increasing 
(or decreasing) in x% for all % £ {1, . . . ,n}, for any increasing (or decreasing) 
function <p on TZ n ~ l , where X^) = {Xj,j £ {1, . . . ,n} — {i}}. This, referred 
to as the positive regression dependence on subset (PRDS) condition in 
[1], defines a wider class than those satisfying the MTP2 condition (see Re- 
sult 2.4). In fact, a multivariate normal with nonnegative correlations, which 
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may not be MTP2 unless it's covariance matrix has an inverse with nonpos- 
itive diagonals, belongs to this larger class. The Simes' inequality holds for 
this larger class of PRDS distributions; see, for example, [1, 16] from which 
this result also follows. The generalized Simes' inequality in Theorem 3.1, 
however, requires the stronger MTP2 condition. In fact, a careful study of 
its proof reveals that, while just Result 2.4 will suffice when k = 1, which is 
the PRDS condition, when k > 1, we need both Results 2.4 and 2.5, thereby 
forcing us to consider the MTP2 condition. 

Now, let the p-values be MTP2 with identical /cth-order joint distribu- 
tions under Hq; for example, consider a situation when they are generated 
from test statistics whose joint density is of the form (2.4) under Hq. Let 
Gk(u) = Prj^ {maxj G jPj < u}, < u < 1, for all J G Ck- Then, the proba- 
bility in the left-hand side of (3.1) under Hq is less than or equal to Gk(a n ) 
if a^ 1 Gk(ai) = a~ 1 G/ c (a n ), i = k, . . . ,n. Thus, this probability is less than 
or equal to a if the ctj's are chosen subject to: 

(3.4) G k (ai) = —a = — — —ret, i = k,...,n. 

a n n{n — 1) ■ ■ ■ (n — k + 1) 

When the Pi's are i.i.d. as uniform on (0, 1) under Hq, these ctj's provide an 
exact value of a for this probability. Thus, we have the following: 

Proposition 3.1. Let a k < • • • < a n be defined as in (3.4). The gener- 
alized Simes' test that rejects Hq if and only if Pi- n < a max (i,/c); f or a t least 
one i = 1, ... ,n, controls the k-FWER at a, exactly when the Pi 's are i.i.d. 
under Hq and conservatively when they are MTP2 with a common kth-order 
joint distribution under Hq. 

Remark 3.2. The first k — 1 critical values in the generalized Simes' 
test can be chosen arbitrarily without affecting control of the A:-FWER. In 
particular, one may choose these to be zero and consider rejecting Hq if and 
only if Pi- n < ai, for at least one i = k, . . . ,n. However, even though we are 
not much interested in the power of our test at an alternative where between 
1 to k — 1 of the component null hypotheses are false, we want to keep it at 
a high value by choosing these critical values as large as possible. Of course, 
it would be counterintuitive if we select them in a way that will make the 
aj's nonmonotone. Thus, the best choice for these is to make them all equal 

to Oik- 

One could use the original Simes' test or come up with a simple gener- 
alization of it with ai = m&x(i,k)a/n, % = 1, ...,n. These will also control 
the fe-FWER, under the weaker PRDS condition of the p-values, because 
of the original Simes' inequality (see Remark 3.1). But, they do not take 
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the full advantage of the notion of fc-FWER, as their critical values are not 
determined by directly controlling it. Our generalized Simes' test, on the 
other hand, directly controls the k-F WER, and does so in the least conser- 
vative manner in the sense that its fc-FWER becomes exactly a under the 
independence case. 

When the p- values are independent, the a^s in (3.4) are given by 




(3.5) ai= all ' i = k,...,n. 



If 2 < k < 1/a, we have 

i — k + j j 1 in — k + j)a 

(3.6 ->T>T> a >~ — , 

i k k n 

for each l<j<k<i<n, implying that 
(3.7) at= a- TT r— > -a, i = k, 

I nrt J- A. /y-i _L_ 1 I n 



n 1L ,n — k+jl n 

.7 = 1 / 



,n. 



Thus, our generalized Simes' test (with 2 < k < 1/a) is more powerful to 
detect k or more false null hypotheses than the original Simes' or its simple 
generalization mentioned above when the p- values are independent. 

To see the extent of power improvement we get by generalizing Simes' 
test in a particular testing situation, we did a numerical study involving 
dependent normals whose findings are discussed in the next subsection. As 
we are mainly interested in the probability of detecting k or more false 
null hypotheses, we consider it as the definition of power, and examine how 
our procedure performs in terms of this compared to Simes' test when the 
number of false null hypotheses is actually k or more. 



3.2. Numerical results. We consider a multiple testing situation where 
the underlying test statistics, Xi ~ iV(/_ij,l), i = l,...,n, are jointly dis- 
tributed as multivariate normal with a known nonnegative common cor- 
relation p and the problem is that of testing -ffo : HiLiI/^ = 0} against 
Hi ■ UiLil^i > 0}. The generalized Simes' test is applicable in this situa- 
tion (see Section 2.3). 

With n = 10 and a = 0.05, we numerically computed the critical values 
of the generalized Simes' test using (1.1)— (1.2). These are listed in Table 1, 
1 for p = 0,0.25,0.50 and 0.75, and k = 1 (original Simes'), 2 and 3. We 
then simulated data to do the power analysis for each of the generalized and 
original Simes' tests. Each simulated power for the generalized Simes' (or 
Simes') test was obtained by (i) generating ten dependent normal random 
variables N(pi, 1), £ = !,..., 10, with a common correlation p and with ni 
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Fig. 1. Comparison of powers of the generalized and the original Simes'' tests for testing 
equicorrelated multivariate normal means with a = 0.05. 



of the ten /Vs being equal to 2 and the rest 0, (ii) applying the generalized 
Simes' (or Simes') test to the generated data using the critical values in 
Table 1 and (iii) repeating steps (i) and (ii) 50,000 times before observing 
the proportion of times at least k of the ten component null hypotheses in 
Hq were rejected. The power comparisons are presented in Figure 1, with 
the four panels in the bottom row representing the power graphs for k = 2, 
p = 0, 0.25, 0.50 and 0.75, and those in the top row representing the graphs 
for k = 3 and the same values of p. 

The generalized Simes' test is significantly more powerful than the original 
Simes' in detecting k or more false null hypotheses when the p- values are 
independent or weakly but positively dependent. As the p-values become 
more and more positively dependent, the generalized Simes' test unfortu- 
nately loses its edge over Simes' test. This phenomenon is, however, not 
surprising, as we can see from the remark following Lemma 2.2 that as p 
increases to 1, the critical values of the generalized Simes' test decreases 
(see Table 1) and eventually become smaller than the corresponding criti- 
cal values of the original Simes' test, thereby making it more conservative. 
We repeated this numerical study with n = 20 and generated similar power 
graphs. Those graphs, not presented in the paper, provided almost same 
pictures as in Figure 1. 

Since we do not seek to control the probability of the number of false 
rejections being between 1 to k — 1 under the global null hypothesis in our 
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generalized Simes' test, one would be curious to see how high this probability 
could be in a particular situation. So, we did some additional calculations 
in the above simulation studies and computed these probabilities. These are 
given in Table 2. As we see from this table, this probability is not excessively 
large. At the maximum, it could be 50% when the p- values are independent, 
but, most often, when the p-values are dependent, it is reasonably low. 

4. Generalized Hochberg's procedure. For simultaneous testing of Hi, 
i = 1, . . . ,n, Hochberg's [5] procedure rejects for i <io and accept 
for i > iq, where i$ = maxi<j< n {i : Pj :n < oti\ with ojj = a/(n — i + 1), i = 
1, . . . ,n. We will generalize this in the following subsection. Later, we will 
discuss the results of a numerical study comparing the performance of this 
generalized procedure with other related procedures. 

4.1. The procedure. Hochberg's procedure is the stepup version of Holm's 
[7] stepdown procedure, and was initially shown in [5] to control the FWER 
under the independence of the p- values using the original Simes' inequality. 
The papers [15, 18] later established the FWER control of this procedure 
under positive dependence. We will generalize these results in this paper 
in terms of the £;-FWER and using the /cth-order joint null distributions, 
assumed common, of the p-values. The resulting A;-FWER procedure is our 
proposed generalized Hochberg's procedure. 

Toward generalizing Hochberg's procedure, we first modify the generalized 
stepdown procedure in [12] controlling the fc-FWER using the /cth-order joint 
null distributions of the p-values, again assumed to be common, and then 
show that the critical values of this modified generalized stepdown procedure 
can be used in a stepup procedure that will also control the /c-FWER, under 
some additional assumptions on the dependence of the p- values. 

In order to modify the stepdown procedure in [12], we obtain a generalized 
version of the usual Bonferroni procedure and then develop its Holm type 
stepdown improvement. This is given in the following theorem. 

Table 2 

Probabilities of falsely rejecting between 1 to k~ 1 of the 
component null hypotheses in the generalized Simes' test 
with a = 0.05 
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0.0783 
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0.0012 
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0.4619 


0.1180 


0.0182 


0.0003 
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Theorem 4.1. Let (P\, . . . ,P n ) have identical kth- order joint distribu- 
tions, with Gt being the c.d.f. of the maximum of any k of them, under 
the null hypotheses. Let a k < • • • < a n be defined as Gk{oti) = a/a n+ k-i,i = 
k,. .. ,n. 

(i) The single-step procedure that rejects any Hi for which Pi < a k con- 
trols the k-FWER. 

(ii) The generalized stepdown procedure that accepts any for which 
i > mini<j< n {j : Pj :n > a max (j,fc)} the minimum exists, otherwise rejects 
all H(i), controls the k-FWER at a. 

Proof. Let no be the number of true null hypotheses. If < no < k, 
then, for any procedure, the /c-FWER is zero and hence trivially controlled. 
So, we assume that k<nQ<n and that the first no of the n Pj's correspond 
to the true null hypotheses, with Pi- no < • • • < Pn :n being their ordered 
versions. Then, we get from (2.3) that at least k of these true null hypothe- 
ses will be rejected by the single-step procedure in (i) with the following 
probability 

Pr{P k -.n < < ( n fc °) Pr{™x Pi < = (7) ^ «■ 

This proves (i). 

To prove (ii), we can argue as in the proof of Theorem 2.2 in [12] to 
claim that if the generalized stepdown procedure rejects at least k of the 
first no hypotheses then Pk :no < Oj 5 where i = k, . . . ,n — no + k. Thus, since 
cti < a n _„, +fc, the fc-FWER of this procedure is less than or equal to 

Pr{Pk:n < "n-no+fc} < ( ? 1° ) G k (a n _ no+k ) = = a. 



\ k J ^n-no+Kj ^no^ — ^ 

Remark 4.1. The stepdown procedure in Theorem 4.1 is our general- 
ized version of Holm's procedure, which is different from that in [12]. When 
the p-values are independent, the critical values of our generalized stepdown 
procedure are given by: 

/ k . \i/fc 

(4.1) ch= aTT — - , i = k,...,n; 

\ j= 1 n -' l +jj 

whereas, those in [12] are given by: 

ka 

(4.2) 04 = — — , i = k,...,n. 

n — i + k 

Note that if < a < 1/k, we have 

> —T, J = l,...,k-1, 



n — i + j n — i + k' 
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for each fixed i = k, . . . , n, implying that a.{ in (4.1) is greater than or equal 
to the corresponding a, in (4.2) if 1 < k < 1/a. Thus, when the p- values are 
independent, our generalized stepdown procedure (with 1 < k < 1/a) is a 
more powerful fc-FWER procedure than the one in [12]. 

Of course, with independent p- values, alternative fc-FWER procedure im- 
proving the Lehmann-Romano procedure can be obtained. The following is 
such a procedure that Joseph Romano pointed out in a personal communi- 
cation. Consider the generalized stepdown procedure with the critical values 
a-i = H^ n _ i+k {a), i = k,...,n, where 



j=k 

the c.d.f. of the /cth-order statistic based on n i.i.d. U(0, 1). However, we will 
not make any attempt to compare it with the one based on (4.1). 

Next, we will show that the stepup version of our generalized Holm's step- 
down procedure also controls the /c-FWER with some additional dependence 
condition of the p- values under the null hypotheses. 

Theorem 4.2. Let (Pi,...,P n ) have an MTP 2 distribution in addi- 
tion to having identical kth- order joint distributions under the null hypothe- 
ses. Consider the generalized stepup procedure based on the critical val- 
ues ctk < ■ ■ ■ < a n in Theorem 4.1, that is, reject any Hn\ for which i < 
maxi<j< n {j : P J:n < o max (j fc )} if the maximum exists, otherwise accept all 
H(i)- This procedure controls the k-FWER at a. 

Proof. Again, let us assume without any loss of generality that k < 
no <n and that the first uq of the n Pi's correspond to the true null hy- 
potheses, with Pi:n < • • • < Pn :n being their ordered versions. Then, we 
get from [13] that the k-F WER of the generalized stepup procedure in the 
theorem is less than or equal to 

(4.3) Pr\\J(Pi:n a <a n . no+ A. 



Since 



we have 



no-k+j i-k+j . 

r— < : , j = l,...,k, 

n o — i + 3 3 



-j-j- np-k + j < jj- i 



a-no+k-i £=i n ° ~ 1 + 3 j=i 



SOME GENERALIZED FWER PROCEDURES 



19 



implying that 

"k {(^n—no+i 



n t \- a ^ aia 

Lrfc ttn-nn+i — - • 



Q>riQ+k— i 0"no 

for i = k, . . . ,n. In other words, the probability in (4.3) is less than or equal 
to 

(4-4) Pr|u(P i:no <a*)|, 

where Gk(a*) = aja/a no , i = k, . . . ,n. The theorem then follows by noting 
that the probability in (4.4) is bound above by a because of the generalized 
Simes' inequality in Theorem 3.1. □ 

Remark 4.2. The stepup procedure in Theorem 4.2 is our generalized 
Hochberg's procedure. One could obtain a different version of it by using 
the critical values of the generalized Holm's procedure in [12]. It would also 
control the fe-FWER, of course, under the additional condition that the 
p- values are positively dependent (recall Remark 3.1). This is because the 
/c-FWER of this procedure, which is bounded above by 



(4.5) 



"0 



ia 



< Pr\ U ( ^.no < n 



is less than or equal to a due to the original Simes' inequality. While as a 
/c-FWER procedure, it is always more powerful than the original Hochberg's 
procedure, ours is even more powerful when the p- values are independent. 

A question that naturally arises in trying to develop A;-FWER controlling 
stepup procedures is: Does the stepup procedure based on our generalized 
Simes' critical values control the fe-FWER? The answer is no, which can 
be proved, as in [8] for the case of k = l, considering independent p- values 
and a configuration of true and false null hypotheses for which the p- values 
corresponding to the false null hypotheses are very close to zero. 

We now report the findings of another numerical study that we conducted 
to examine the power performance of our generalized Hochberg's procedure 
with those of the Lehmann-Romano version of the generalized Hochberg's 
procedure and the original Hochberg's procedure. As a measure of power, 
we consider the average power (AvePower), the expected proportion of false 
null hypotheses that are correctly rejected, which is commonly used in si- 
multaneous testing; see, for example, [2]. 
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Fig. 2. Comparison of average powers of the proposed generalized Hochberg' s proce- 
dure, Lehmann-Romano version of the generalized Hochberg' s procedure and the original 
Hochberg's procedure for testing equicorrelated multivariate normal means with a = 0.05. 



4.2. Numerical results. We generated 100 dependent normal random 
variables iV(/z-,l), i= 1,...,100, with a common correlation p, where n\ 
of these 100 /Vs are all equal to 2 and the rest are all equal to 0. We then 
applied to this data set each of the aforementioned procedures to test if 
each of these means is either true (m = 0) or false (//- > 0), and noted what 
proportion of the m means that are all equal to 2 were correctly declared as 
false. We repeated this experiment 20,000 times and obtained the average 
of these proportions to obtain the simulated AvePower for each procedure. 
Figure 2 compares the average powers of these procedures, the five panels in 
the bottom row presenting this comparison for k = 2, p = 0,0.10,0.25,0.50 
and 0.75, and those in the top row presenting it for k = 3 and the same 
values of p. 

The proposed generalization of Hochberg's procedure is seen to be signifi- 
cantly more powerful compared to any of the other two procedures when the 
p-values are independent. With increasing positive dependence among the 
p-values, our procedure, however, loses its power, which is consistent with 
the similar behavior of the generalized Simes' to which it is related. 

We carried out a similar simulation study comparing our stepdown pro- 
cedure with Lehmann-Romano 's but did not include those graphs here as 
they have produced the same comparative picture as Figure 2. 
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5. An application. We consider the data set in [14] that consists of all 
hedge funds in the CISDM (Center for International Securities and Deriva- 
tives Markets) database having complete return history from 01/1992 to 
03/2004. There are 105 such funds each having 147 monthly returns that 
are net of management and incentive fees. The problem we consider is that 
of identifying financial strategies, if there are any, that outperform a bench- 
mark. As it is standard in the hedge fund industry, the benchmark chosen is 
the risk free rate of return (3- month T-bills). The random variable of inter- 
est is the difference in log returns between a particular fund and the T-bill. 
A weak positive dependence among the funds in terms of this variable is 
assumed observing that the 5,460 sample correlations have an average of 
0.27. The returns for hedge funds are auto correlated, see [14]. Therefore, we 
fitted an AR(1) model with a constant term to the log-return differences for 
each fund and tested if that constant is significantly different from zero or 
not. The p- value was computed for each fund before applying our general- 
ized Hochberg's and the stepup version of the Lehmann-Romano fc-FWER 
procedures at a = 0.05 for k = 1, 2 and 3, p = 0.00, 0.10 and 0.25. Recall that 
when k = 1 both our and the Lehmann-Romano procedures are same as the 
original Hochberg's procedure, and that the Lehmann-Romano procedure 
does not depend on p. Table 3 shows the number of funds that are declared 
significantly outperforming the benchmark according to these procedures. 

6. Concluding remarks and additional numerical investigations. An at- 
tempt has been made in the article to utilize fcth-order joint null distribu- 
tions of the p- values in order to improve fe-FWER procedures for a global 
or simultaneous testing that are based only on the marginal p-values. The 
underlying idea seems intuitively reasonable when the joint null distribution 
of the p-values are known and computationally feasible with k not being 
excessively large. Nevertheless, while we have been quite successful in our 
attempt when the p-values are independent or weakly but positively depen- 
dent, the idea does not appear to work that well when there is a strong 
positive dependence among the p-values. The main reason, of course, is that 
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Number of funds declared outperforming the benchmark at 
a = 0.05 
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23 
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the critical values become smaller with increasing dependence among the 
p- values, making the corresponding procedure more and more conservative. 
It is important to note, however, that we cannot just use the critical values 
corresponding to the independence case even when the p-values are depen- 
dent. This will not control the fc-FWER. Results in this paper have been 
used very recently in [4, 17]. 

Although we have assumed throughout this article that the p-values are 
marginally distributed as uniform on (0,1) under the respective null hy- 
potheses, we could relax this when the p-values are independent. In Simes' 
test or its proposed generalization, we could assume the p-values to be 
stochastically larger than £1(0,1), that is, Prn^Pi < u} < u, under the 
null hypotheses when they are independent. The fc-FWER would still be 
bounded above by a [which can be checked from the first equality in (A. 4)]. 
The same assumption can be made in the independence case for the general- 
ized stepwise procedures considered in this article. While in our generalized 
single-step and stepdown procedures we have assumed only the availabil- 
ity of the /cth-order joint null distributions of the p-values, in our proposed 
generalized stepup procedure we have made some additional assumption on 
the dependence structure of the p-values. We could actually forgo this ad- 
ditional dependence assumption and develop alternative stepup procedures 
that control the fc-FWER, as in [12, 13], by making use of the inequality 



The procedures proposed in this article are heavily dependent on the 
MTP2 and "identical /cth-order joint null distributions" assumptions. What 
if one or both of these assumptions are violated? Considering first the gen- 
eralized Simes' test, we did some numerical calculations to see how it can 
be handled in a situation where the p-values are MTP2 but do not have 
identical /cth-order joint null distributions. In particular, we considered the 
scenario where X{ ~ A(/ij, 1), i = 1, . . . ,n, are jointly multivariate normal 
with correlations pij = XiXj, for some < Xi,Xj < 1, i, j = 1, . . . , n, with 
Hq : f]™ =1 {fii = 0} and #1 : [J" =1 {//j > 0}. This is the situation that occurs 
in the many-to-one comparison problem in unbalanced one-way setup [6]. 
The AYs are MTP2, as the off-diagonal entries of the inverse of this corre- 
lation matrix are all negative; see, for example, [10], but now they do not 
have identical fcth-order joint null distributions. We modified the generalized 
Simes' test in this case by considering 



in place of to determine the critical values. We numerically computed 
its fc-FWER (weak) and the power, the probability of correctly rejecting k 
or more of the component null hypotheses, and compared them to those of 



(2.2). 
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the original Simes' test with k = 2, n = 20, Aj = \/0.25 for i = 1, . . . , 10, and 
= V0?75, for i = 11,..., 20. Fi gure 3 shows this power comparison for differ- 
ent values of m, where it is assumed that /ij = for i = 1, . . . , no = n — m, 
and = 2 for the other ni z's. When ni = 0, that is, under Hq the pow- 
ers are 0.02690 and 0.02060 for this modified generalized Simes' and the 
original Simes' tests, respectively, indicating that the modified generalized 
Simes' test controls the 2-FWER, slightly better than the original Simes' 
test. Moreover, the modified generalized Simes' test performs reasonably 
well in terms of power compared to the original Simes' test. The simulated 
powers were based on 20,000 iterations. So, the idea of averaging out the 
Gk to modify the generalized Simes' test when the p-values are MTP2 but 
do not have identical /cth-order joint null distributions seems to work in the 
present scenario. The same would also work for the corresponding general- 
ized Hochberg's procedure. Proving these analytically for a general scenario 
of this type would be an interesting and challenging undertaking. 

We also numerically investigate how the proposed procedures would per- 
form if the MTP2 condition is violated. For instance, suppose that we have 
a central multivariate t corresponding to an equicorrelated standard mul- 
tivariate normal with nonnegative correlations. The MTP2 condition does 
not hold here, although the PRDS does; see, for example, [1, 16]. We ran a 
simulation study for this distribution based on 20,000 iterations considering 




Fig. 3. Comparison of powers of the modified generalized and the original Simes' tests 
for testing multivariate normal means when pij = A; Aj , where \i = \/0.25 for i = 1, . . . , 10, 
= VUT75 for i = 11, ... , 20, k = 2 anda = 0.05. 
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p = 0.25. Figure 4 shows the performance of the generalized Simes' test over 
the original Simes' for different values of the degrees of freedom, it seems 
that the generalized Simes' test, and hence, the generalized Hochberg's pro- 
cedure, would still work in this case. 

We have shown theoretically that our proposed fc-FWER procedures with 
k < 1/a are more powerful than the corresponding existing procedures when 
the p-values are independent and given empirical evidence that this power 
improvement is still maintained when the p-values are weakly dependent. 
What happens when k > 1/a, as in high-dimensional settings where the 
number of tests might be in the thousands and one might want to set k at a 
large value? We numerically investigated this question only for the general- 
ized Hochberg's procedure, expecting that similar conclusion can be drawn 
from this study regarding the generalized Simes' test. We ran another simu- 
lation study extending that in Section 4.2 from 100 to 1,000 tests and recom- 
puting AvePower's for all these three procedures based on 20,000 iterations 
for values of k much larger than 3. We, however, considered p = 0, since 
the critical values become increasingly difficult to calculate as k becomes 
larger when p > and we expect, as for smaller values of k, that the per- 
formance of our procedure for weakly dependent p- values would not change 
much from the situation when p = 0. Figure 5 shows the power performance 
of the generalized Hochberg's procedure in these situations. Comparing this 




Fig. 4. Comparison of powers of the generalized and original Simes' tests for multivariate 
t corresponding to equicorrelated multivariate normal with p — 0.25, k = 2 and a — 0.05. 
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Fig. 5. Comparison of average powers of the proposed generalized Hochberg 's proce- 
dure, Lehmann-Romano version of the generalized Hochberg' s procedure and the original 
Hochberg' s procedure for testing independent normal means with a = 0.05. 



with Figure 2, we see that the performance of our procedure actually gets 
better when k > 1/a. 

APPENDIX 

A.l. Proof of Lemma 2.1. First, we have 



(A.l) 



Prl \J(X tn <Ci 



Pr{Xi :n < Ci, Xi+i- n > Cj + i, . . . , X n - n > c n ,}. 



i=k 

For each i = k, . . . ,n, 

(A.2) Pr{X i:n <Ci,X i+ i :n >c i+ i,...,X n:n >c n }= ^ Pr{Aj*}, 

J*ec t 



where 



7* = \ ] 



Aj* = |maxAj <Ci,X 1:J *c > c i+1 , . . . , A n _ i: j. c > c n }. 
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Since for J £ and J* £ with k<i, 



{maxX,< Q }n^ = {^ 



J A/*, ifJCJ*, 
otherwise, 



we see that 

i 
k 



EM4/*}=E E MA/*} 
J* eCi J* GCi Jec k -.jcj* 



= E E ^({maxX^JnA, 
(A.3) 

= E Pr i X k-J < ^i-fc:J c < Cj, 

Jec fc 

-Aj_fc+l : Jc > Cj+1) • • • > A n _fc : Jc > C n }- 

Thus, we get from (A.l), (A.2) and (A.3) that 
Pr^\J(X i:n < Ci )^ 

n 

= E ^2 a 7 1Pr i X k:J < Ci,Xi_ k: Jc < a, 
J^Ck i=k 

Xi-k+l:J c > Cj+lj • • • ,X n _k:J c > C n } 

= E \a- 1 E[{l-i; n - k (Xj)}I(X k: j<c n )] 

n-1 

(A.4) + E J B[^-fe+i(Xj)a- 1 /(X fc:J < a)] 



i=k 
n-1 



i=k+l ) 

= a" 1 E P r {Xk j < c n } 

J&c k 

n-1 

+ E E^[v^fc+i(Xj)K rl A^ : j<c 4 ) 

JeCfc i=fc 

- a^VfA^j < Cj+i)}]. 

This proves the lemma. 
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A.2. Proof of Lemma 2.2. Writing Gk as a function of A = p 1 / 2 and 
letting <E> -1 (1 — u) = t, we have 



(A.5) 



Since 



1_$ 



t-Xy 



i fc-i 



3 



-<I> 



t-Xy 



dX \^T^>? 



±*( tz* 

dX \^T^V 



(y - xt) 



( y-xt 



(i-A 2 ) v / r^A 2 VvT^ 

making the transformation y = vy/l — X 2 + Xt in (A.5), we get 



4>(y) dy. 



<Kt), 



k(f>(t) 

> k<t>(t) 



[1 - $(i\/l-A 2 - Av)] fc_ V(w) cfo 



[1 - *(tVl — A 2 — At/)]*~V(i>) <fo x / v^(v)dv 



1-A 2 
= 0, 

as 1 — — A 2 — Au) is increasing in v. 
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